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FIELD OF THE INVENTION 

This present invention relates to digital circuits. More specifically, it relates to 
delay locked loop design. 



BACKGROUND OF THE INVENTION 

Accurate high-speed clock signals are often used for transmitting and receiving 
data in high-speed circuits, such as high-speed memory devices or high speed bus 
channels. However, high speed clock signals often become distorted during transmission 

10 and reception. Consequently, high speed circuits, such as dynamic random access 
memory (DRAM) devices, often recover an externally provided high-speed clock signal 
by locking an accurate internally generated clock signal to the distorted externally 
provided clock signal. 

A delayed locked loop (DLL) is typically used to delay the internally generated 

15 clock signal in order to match the phase of the internally generated clock signal to the 
phase of some reference clock signal. Typically, a phase-detection circuit in the DLL 
compares the phase of the internal clock signal to the reference signal and a control-logic 
block that is coupled to the output of the phase-detection circuit is used to increase or 
decrease a delay produced by a chain of delay elements used to delay the internal clock 

20 signal U.S. Patent Nos. 5,945,862 and 6,125,157 to Donnelly et aL represent two 
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standard digital design techniques. 
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SUMMARY OF THE INVENTION 

In accordance with a first aspect of the present invention, a circuit for selectively 
delaying a reference clock signal is provided. The circuit includes a phase splitter 
coupled to a first delay line and a second delay line. Each delay line includes a set of 
5 output taps and has a multiplexor that is selectively coupled to the output taps. An output 

m*. - »^ 

of each multiplexor is coupled to a latch. The latch provides an output clock signal, 
which may be fed back to a phase detector, where the phase detector also receives an 
input clock signal. A controller is coupled to the phase detector and, depending upon the 
output of the phase detector, provides control signals to each multiplexor. 
10 In a preferred embodiment, the circuit is implemented using standard digital 

design techniques. 

In accordance with a second aspect of the present invention, a method for 

recovering a clock signal from an input clock signal is provided. The method includes 

converting a reference clock signal into an in-phase reference and a complementary 

15 reference, and selectively delaying the in-phase reference and the complementary 

reference. The method further includes generating an output clock signal from the 

selected in-phase reference and the selected complementary reference. 

In accordance with the preferred embodiment, the selective delay of the in-phase 

reference and the complementary reference is~based upon a difference signal determined 

20 by comparing an input clock signal with a feedback clock signal, where the feedback 

clock signal is related to the output clock signal. 

-4- 



MdQonneli Socmen Huftert & Bergtofl 
300 S. W acker Drive, Floor 3200 
CttcagoJL 60606 
312-013-0001 



In accordance with another embodiment, the method is implemented using digital 
design techniques. 

This summary is not intended to be all-inclusive, but rather illustrative. These 
and other aspects of the present invention, and its various embodiments, are described in 
greater detail below. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Preferred embodiments of the present inventions are described with reference to 
the following drawings, wherein: 

FIG. 1 is a functional block diagram of one embodiment of a DLL circuit 
5 according to the present invention; 

FIG. 2 is a timing diagram illustrating an example of the function of the DLL 
circuit of FIG. 1; 

FIG. 3 is a control flow diagram illustrating an embodiment of the function 
performed by the controller of FIG, 1; 
10 FIG. 4 is a logic circuit diagram illustrating an embodiment of a circuit for 

generating a quadrature clock according to one aspect of the present invention; 

FIG. 5 is a control flow diagram illustrating an embodiment of the function 
performed by a controller to generate control signals to obtain the quadrature clock signal 
from the circuit of FIG. 4; 
15 FIG. 6 is a timing diagram illustrating an example of the function of the 

quadrature clock circuit of FIG. 4; 

FIG. 7 is a functional block diagram of another embodiment of a DLL circuit 
according to the present invention; 

FIG. 8 is a functional block diagram illustrating an example of a delay element 
20 suitable for use in the delay line circuits of FIG. 7; 
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FIG. 9 is a control flow diagram illustrating an embodiment of the function 
performed by the controller of FIG. 7; 

FIG. 10 is a functional block diagram illustrating an example of a circuit for 
detecting a cycle boundary in the delay line circuits of FIG. 7; 
5 FIG. 1 1 is a transistor circuit diagram which illustrates an example of a prior-art 

tri-state inverter circuit configured to be enabled by a differential enable signal that may 
cause a multiplexor circuit which uses it to experience a data dependent response; 

FIG. 12 illustrates a circuit that addresses the problem of data dependent response 
exhibited by the circuit of FIG. 11; 
10 FIG. 13 is a functional block diagram illustrating an example of a two level 

multiplexor design suitable for use with the circuits of FIG. 1 and FIG. 7; and 

FIG. 14 is a logic circuit diagram illustrating an embodiment of a RS flip-flop 
suitable for use in the described embodiments. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 



In one preferred embodiment, a reference clock having a correct duty cycle, e.g. 
50%, is split into an in-phase clock reference signal and a complementary reference clock 
signal, each of which is successively delayed using a pair of delay lines. A set of output 

5 taps on the delay line allows the selection of and variation in the length of the delay. A 
pair of multiplexors (MUXes) is used to select one output tap from each of the delay 
lines. In accordance with one preferred embodiment, a rising edge of a signal from the 
output tap selected from the delay line used to delay the in-phase reference signal is used 
to produce a rising edge in an output clock signal. Then, a rising edge of a signal from 

10 the output tap selected from the delay line used to delay the complementary reference 
signal is used to produce a falling edge in the output clock signal. A feedback clock 
signal is compared to the input clock signal in order to obtain a difference signal that 
reflects the phase relationship between the feedback clock signal and the input clock 
signal. A control circuit monitors the difference signal and generates a control signal that 

15 drives the selection performed by the MUXes such that a rising edge of the feedback 
clock is aligned or lock-in to a rising edge of the input clock signal. 

In another aspect of the preferred embodiment, a power control signal controls 
delay elements within the delay lines and the control circuit is configured to generate the 
power control signal based upon the control- signal provided to the MUXes such that 

20 unused elements in the delay lines are disabled during operation to reduce power 
consumption. 
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In still another aspect of the present invention, four additional MUXes are used to 
select output taps from each of the delay lines in order to produce a quadrature clock 
signal. The additional outputs taps are selected such that they are offset from the 
previously selected output taps by an offset value, where one offset output tap is 

5 advanced by the offset value and the other offset output tap is delayed by the offset value. 
The additional output taps are interpolated in order to generate the quadrature clock. A 
quadrature control circuit varies the offset value and monitors the relationship between 
edges in the signals from the offset output taps in order to drive the additional pairs of 
MUXes. The quadrature control circuit selects an offset value that results in the edges 

10 from the offset output taps being generally aligned. 

FIG. 1 is a functional block diagram of one embodiment of a delay locked loop 
("DLL") circuit according to the present invention. A reference clock signal REFCLK is 
received by a phase sp Utter 102, which splits the REFCLK into a positive phase reference 
signal RP (also referred to as an in-phase reference signal) and a negative phase reference 

15 signal RN that is 180° out of phase with RP (also referred to as the complementary 
reference signal). The positive phase reference RP is input to a delay line 110, which has 
N output taps that are input to a multiplexor (MUX) 1 12. The MUX 112 also receives a 
CONTROL signal that selects one of the N inputs from the delay line 110 to be output 
from the MUX 112 as a ZUP signal to set an S input of a Set/Reset latch 130. The delay 

20 line 110 successively delays the reference signal RP such that each one of the N output 
taps represents RP delayed by a multiple of a predetermined delay interval. Thus, 
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selecting one of the N output taps from the delay line 1 10 using the MUX 1 12 permits the 
RP signal to be delayed by from 1 to N delay intervals before being input to the S 
terminal of the latch 130. 

Likewise, the delay line 120 delays the negative phase reference signal RN by 

5 from 1 to N delay intervals, as selected by the MUX 122. The delay line 120 receives 

•• - ^ 

RN and has N output taps that are input to the MUX 122. The MUX 122 also receives 
the CONTROL signal, which determines which of the N output taps of the delay line 120 
to output as a ZDOWN signal to a reset input R of the latch 130. The MUX 122 is 
configured to respond to the CONTROL signal in the same fashion to the response of the 

10 MUX 112. In other words, the delay line 120 and the MUX 122 are configured relative 
to the delay line 110 and the MUX 112 such that the ZUP and ZDOWN signals are 
complementary to one another in generating signals responsive to the CONTROL signal 
output by controller 140. The ZUP and ZDOWN signals control the up/down edges of 
the output signal. The output signal drives some collection of circuitry to produce a 

15 signal FB_CLK that is aligned in zero degrees in phase with the input clock signal. The 
resulting output clock signal CLKOUT generated by latch 130 is a clock signal whose 
edges are determined by the CONTROL signal and which has a 50% duty cycle. 

A phase detector circuit 142 receives a feedback clock signal FB_CLK derived 
from the CLKOUT signal and also receives an input clock signal INPUT CLK that is the 

20 signal to which the circuit 100 is to lock onto. The CLK_OUT may move in time in a 
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manner that the resulting phase relationship of the FB_CLK relative to the INPUT CLK 
is well managed. 

The phase detector 142 compares the FB_CLK and the INPUT CLK and outputs 
a difference signal DIFF that indicates whether the phase of FB_CLK leads or lags the 

5 phase of INPUT CLK. The DIFF signal is received by the controller 140, which adjusts 
the CONTROL signal accordingly. As one example for the control circuit 140, the 
circuit may consist of a counter, which is incremented or decremented by the DIFF signal 
in order to adjust the CONTROL signal. It should be understood that someone skilled in 
art of PLL or DLL design will recognize that the circuit which provides the control signal 

10 to the VCO or delay-line of a DLL can be implemented in many fashions, and other 
circuits of varying complexity could also be used. 

FIG, 2 is a timing diagram illustrating an example of the function of the DLL 
circuit of FIG. 1. In FIG. 2, the reference clock signal REFCLK is shown along with the 
reference clock signals RP and RN output from the phase splitter 102. Note that 

15 REFCLK has a 50% duty cycle. Also note that RP and RN are complementary to one 
another, e.g. 180° out-of-phase with one another and that a delay is introduced to RP and 
RN relative to REFCLK by the phase splitter 102. 

FIG. 2 also includes an example of an external input clock signal INPUT CLK. 
Note that, in this example, the INPUT CLK signal waveform is distorted due to the 

20 effects of transmission and, consequently, the received INPUT CLK waveform does not 

have a 50% duty cycle. In one embodiment, phase detector 142 compares the phase of 
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rising edge 150 of INPUT CLK to the FB CLK, which is derived from CLK OUT. 
Controller 140 receives the DEFF signal and adjusts the value of the CONTROL signal in 
order to select a delay tap DP(n) from delay line 110 that aligns the rising edge of FB 
CLK with the rising edge 150 of INPUT CLK. In this example, DP(n) is selected such 

5 that rising edge 152 of DP(n) is aligned with rising edge 150 of INPUT CLK. The rising 
edge 152 of DP(n), in turn, drives the set input of the latch 130 triggering the rising edge 
154 of the CLK OUT signal. 

The CONTROL signal will also select DN(n), from the delay line 120, for output 
by the MUX 122 to the reset terminal of the latch 130. DP(n) and DN(n) are generally 

10 complementary to one another, but are typically distorted by their respective delay lines 
110 and 120 such that neither DP(n) nor DN(n).will have a correct 50% duty cycle. 
However, the distortion will effect the rising edges of DP(n) and DN(n) in the same 
manner. Therefore, while the duty cycles of DP(n) and DN(n) are distorted, the 
relationship between the rising edge 152 of DP(n) and the rising edge 160 of DN(n) is 

15 reliable. Thus, in the present invention, rising edge 160 of DN(n) is used to drive the 
reset input of the latch 130 triggering a falling edge 162 of the CLK OUT. The cycle 
then continues as demonstrated by next subsequent a rising edge 170 of DP(n) that 
triggers a rising edge 172 of the CLK OUT. The result is that the circuit 100 of FIG. 1 
may produce a CLK OUT signal having a 50% duty cycle. 

20 Note that the timing diagram of FIG. 2 has been simplified to demonstrate the 

relationship between edges in the various waveforms. The actual waveforms are subject 
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to further delays introduced, for example, by the MUXes 112 and 122, the latch 130, as 
well as other circuitry that is intermediate to the CLK OUT signal and the FB CLK 
signal. 

As noted above, the controller 140 may be, for example, a counter, or composed 

5 of complex circuitry, such as a processor, that monitors the output of the phase detector 

— - ~, 

142 and generates the CONTROL signal. FIG. 3 is a control flow diagram illustrating an 
embodiment of a process 180 performed by the controller 140 for generating the 
CONTROL signal. In the process 180, at step 182, an initial value of n is selected and, at 
step 184, the CONTROL signal for controlling the delay of delay lines 110 and 120 is 

10 generated according to the initial value of n selected at step 1 82. 

At step 186, the output of the phase detector. 142 is checked to determine if signal 
lock has been achieved between FB CLK and INPUT CLK, e.g. the rising edge of FB 
CLK is aligned with the rising edge of INPUT CLK. If signal lock is achieved, then 
control flow branches at step 190 to step 194, where the process waits for a short time 

15 interval before branching back to step 186 to check the output of the phase detector 182 
again. 

If signal lock is not achieved, then control flow branches at step 190 to step 192 

where the value of n is adjusted according to the phase detector output. For example, if 

the DIFF signal is negative, then n is decremented. If the DIFF signal is positive, then n 

20 may be incremented. Process 180 then branches back to step 184, where the CONTROL 

signal is generated according to the adjusted value of n. 
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Note that other approaches are also possible. For example, the signal lock branch 
step 190 and wait step 192 may be omitted so that the value of n is always adjusted. In 
this case, when signal lock is achieved, the value of the DEFF signal will toggle between 
positive and negative on each loop through the process. Control for DLLs is well 

5 understood in the art and one of ordinary skill will readily appreciate that the circuit of 
FIG. 1 may be adapted to work with a variety of approaches to DLL design. 

In another embodiment, a quadrature clock signal may also be generated. A 
quadrature clock is a clock that is out-of-phase by 90° or 270° from a main clock signal. 
Figure 4 illustrates an embodiment of a quadrature clock generation circuit 200 that may 

10 be used in combination with the circuit of FIG. 1. The quadrature clock generation 
circuit 200 includes a MUX 210 that receives N output tap lines of the delayed signal DP 
from the delay line 110, and is controlled by a control signal XDPCTRL1, which selects 
one of the taps DP(n+X) for output to a first input of an interpolator circuit 262. The 
interpolator circuit 262 illustrated in FIG. 4 is composed of three inverters 212, 220 and 

15 232. Interpolator circuits are generally understood to those of ordinary skill in the art, 
and other interpolator designs may be adapted for use in the present invention. The 
circuit 200 further includes a second MUX 230, which receives the N output tap lines of 
the delayed signal DN from the delay line 120 and a control signal XDNCNTRL1, which 
selects one of the taps DN(n-X) for output -to a second input of the interpolator circuit 

20 262. 
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The interpolator circuit 262 interpolates between a rising edge of the DP signal 
and the rising edge of the DN signal, i.e., it takes an average of the 0-degree signal and 
the 180-degree signal to generate a 90-degree QUP signal. To do that, the interpolator 
262 uses a late version of the DP signal and an early version of the DN signal, i.e., the 
signals DP(n+X) and DN(n-X), respectively. 

The inverter 212 of the interpolator circuit 262 receives the DP(n+X) signal, and 
the inverter 232 receives the DN(n-X) signal. The outputs of both inverters are coupled 
to the input inverter 220, which possesses a parasitic capacitance. The inverters 212 and 
232 are relatively small devices relative to the inverter 220. For instance, the inverter 
200 may be twice the size of the inverters 212 and 232. The inverters 212 and 232 will 
charge the parasitic capacitance of the input of the inverter 220 until a threshold voltage 
for the inverter 220 is reached causing its output to change state in order to generate a 
QUP signal to set an S input of an RS latch 260. When the signals driving the inverters 
212 and 232 are separated in time by less than the time it takes for the inverter 212 to 
fully transition the output of the inverter 220, then the inverter 232 contributes to the 
voltage transition at the output of the inverter 220. Thus, the interpolator circuit 262 will 
interpolate between a late version of DP signal and an early version of the DN signal in 
order to obtain a 90-degree QUP signal. 

Further, as illustrated in FIG. 4, the circuit 200 includes a second pair of MUXs 
244 and 246, and a second interpolator circuit 264 composed of inverters 248, 250, and 

252 that generate a 270-degree QDOWN signal. The MUX 244 receives the N output tap 
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lines of the delayed DP signal from the delay line 110, and is controlled by a control 
signal XDPCNTRL2, which selects one of the taps DP(n-X) for output to the second 
interpolator circuit 264. Similarly, the MUX 246 receives the N output tap lines of the 
delayed signal DN from the delay line 120 and control signal XDNCNTRL2, which 

5 selects one of the taps DN(x+N) for output to the interpolator circuit 264. 

To generate the QDOWN signal, the interpolator circuit 264 takes an average of 
the 180-degree signal and the 0-degree signal (or 360-degree signal) to generate a 270- 
degree QDOWN signal. To do that, the interpolator 264 uses an early version of the DP 
signal and a late version of the DN signal, i.e., the signals DP(n-X) and DN(n+X), 

10 respectively. The QDOWN signal is then input to a reset input R of the latch 130, and 
the latch 130 outputs a quadrature clock Q_CLK_OUT that is out-of-phase by 90 degrees 
or 270 degrees from the main clock. 

The circuit 200 further includes a quadrature control circuit 240 that generates the 
four control signals XDPCNTRL1, XDNCNTRL1, XDPCNTRL2, and XDNCNTRL2 

15 that are input to the MUXes 210, 230, 244, and 246, respectively. The quadrature control 
circuit 240 receives a TEST signal along with the value n from the CONTROL signal of 
the control circuit 140 of FIG. 1 and uses the values of these signals to generate the four 
control signals for the MUXes. The value of the TEST signal qualifies the value of the 
offset signal X used by the quadrature control circuit 240. As illustrated in FIG. 4, a 

20 phase detection circuit, such as a D flip-flop 242, generates the TEST signal The 

DP(n+X) signal selected at the MUX 210 drives a data input terminal of the D flip-flop 
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242, while the DN(n-X) selected at the MUX 230 drives a clock input terminal of the D 
flip-flop 254. A Q output terminal of the flip-flop 254 generates the TEST signal for 

* 

input to the quadrature control circuit 240. 

According to one embodiment, starting with a small value of X and incrementing 

5 it, the flip-flop 242 will change the state from 1 to 0 at some value of X that is used to 
generate the QUP and QDOWN signals. Similarly, a large value of X could be first 
selected and then decremented until the transition occurs. Alternatively, the DN(n-X) 
signal could be input to the D input, and the DN(n+X) could be then input to the clock 
input of the D flip-flop 242. In such an embodiment, the selected value of X corresponds 

10 to a value of X when the flip-flop 242 changes from 0 to L Thus, the phase detection 
circuit detects when the first input is no longer ahead of the second input, at which point 
the step of incrementing/decrementing the value of X is stopped, and the value of X can 
be used as an input to the quadrature control circuit 240. 

The function of the interpolator of FIG- 4 is further explained below in 

1 5 conjunction with FIG. 6. 

In FIG. 4, a change in state of the TEST signal from the Q output terminal of the 
flip-flop 242 indicates the midpoint between the two signals being interpolated. FIG. 5 is 
a flow chart illustrating one embodiment of a process 250 performed by the quadrature 
control circuit 242. The process 250 starts- from a minimum value for X, which is 

20 initialized at step 252. The quadrature control circuit 240 then generates control signals 

at step 254 based upon DP(n+X) and DN(n-X), respectively. The functionality of the 
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control circuit 240 may be integrated with some embodiments of the control circuit 140 
of FIG. 1. 

The value of the TEST signal is checked at step 256 to determine if the TEST 
signal has transitioned from logic one to logic zero. If the TEST signal is at logic zero, 
then the midpoint may have not been reached, and the control flow branches to step 262, 
where the value of X is incremented. The control flow then branches back to step 254 
where a new set of control signals is generated based on the incremented value of X. The 
value of the TEST signal is again verified at step 256. If the TEST signal has transitioned 
to the logic zero, then the control flow branches at step 260 to step 264, where the value 
of X is decremented. The value of X is decremented because the transition to logic zero 
indicates that the midpoint has been passed and the value of X is decremented, in this 
embodiment, to obtain a more accurate midpoint value. The value of X is now set for 
operation, and the QUP and QDOWN signals are generated. As mentioned in the 
preceding paragraphs, instead of setting the value of X to a minimum value, the value of 
X may be set to a maximum value and then decremented until the transition is detected. 

Interpolation of the 90-degree quadrature clock signal is illustrated in the 
waveform of the timing diagram of FIG. 6. The edges produced by the DP(n+X) and the 
DN(n-X) are shown as the value of X is successively incremented (or decremented) until 
the two edges converge at 90-degrees with the result being a QUP signal. 

As noted above, a 270-degree clock signal may alternatively be generated. 

Whereas the 90-degree QUP signal was derived by interpolating a midpoint that is 
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delayed from DP(n) and advanced from DN(n), e.g., taking a late DP signal and an early 
DN signal, e.g., using DP(n+X) and DN(n-X), a 270-degree QDOWN signal can be 
derived from an early version of DP signal and a late version of the DN signal, e.g., 
DP(n-X) and DN(n+X). The interpolation of the 270-degree QDOWN signal is also 

5 illustrated in the waveforms of the timing diagram of FIG. 6. 

In accordance with another embodiment, unused delay elements of the delay lines 
may be powered down to save power. For example, once the value of n is determined, 
some or all of the subsequent delay elements in the delay line may be turned off. FIG. 7 
is a functional block diagram illustrating another embodiment of a DLL circuit 300 

10 according to the present invention. The DLL circuit 300 of FIG. 7 is similar to the DLL 
circuit 100 of FIG. 1, except that delay lines 310 and 320 are composed of delay elements 
that may be powered down and a controller circuit 340 is adapted to generate a POWER 
CONTROL signal for powering down elements of the delay lines 310 and 320. In this 
embodiment, the controller 340 is obviously more complex than an up/down counter. 

15 FIG. 8 illustrates one example of a delay element circuit 350 that can be powered 

down and is suitable for use in the delay lines 310 and 320 of FIG. 7 and FIG. 1. The 
circuit 350 is a differential circuit that receives and outputs complementary signals. 
Inputs IN and INB receive the delayed signal output from a previous delay element in a 
sequence of the delay line. Outputs NEXT -and NEXTB are output, for example, to the 

20 next delay element in the sequence of the delay line. OUT and OUTB are output tap 

signals that are output to the MUXes 112 and 122. The delay for the delay element 
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circuit 350 is primarily provided by NAM) gates 360 and 362. Inverters 356 and 358 
primarily help keep the delay pass complimentarily. An ENABLE signal is input to each 
of the NAND gates 360 and 362. When the ENABLE signal is at a logic one, then the 
input signal at input terminals IN and INB is enabled to pass through the delay element to 
5 the next delay element and to the output terminals OUT and OUTB. When the ENABLE 
signal is at a logic zero, then the input signal at input terminals IN and INB is blocked. 
The NAND and inverter gates may be selected such that no current is drawn when no 
data transitions take place, e.g. complementary metal-oxide semiconductor (CMOS) 
logic. 

10 The POWER CONTROL signal output by the control circuit 340 separately 

drives the ENABLE line of the delay elements in the delay lines 310 and 320. Once the 
value of n is determined to align the DLL circuit with the INPUT CLK signal, then 
unused delay elements may be disabled to reduce power consumption. 

FIG. 9 is a control flow diagram illustrating an embodiment of a process 370 

15 performed by control circuit 340 for determining the value of n and setting the POWER 
CONTROL signal. Process 372 is similar to process 180 of FIG. 3. At step 372 (step 182 
in FIG. 3) of process 370, the control circuit 340 initializes the value of n, and the 
POWER CONTROL signal is initialized such that all elements of delay lines 310 and 320 
are active. At step 374 (step 184 in FIG. 3), the delay CONTROL signal is generated that 

20 drives the MUXes 1 12 and 122 of the delay circuit 300 to select the output taps from the 

delay lines 310 and 320. At step 376 (step 186 in FIG. 3), the output from a phase 
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detector is checked to determine whether the phase of the FBCLK signal lags or leads the 
phase of the INPUT CLK signal. If the circuit 300 has not locked to the INPUT CLK 
signal, then control flow branches at step 380 to step 188, where the control circuit 340 
adjusts the value of n according to the signal output by the phase detector 142 and the 
5 control flow returns to step 384 where the delay CONTROL signal is generated using the 
new value of n. 

If the circuit 300 has locked onto the INPUT CLK signal, then the control flow 

branches at step 380 to step 382, where the control circuit 340 determines a power control 

position that may be used to adjust the POWER CONTROL signal to turn off unused 
10 delay elements in the delay lines 310 and 320. At step 384, the control circuit 340 adjusts 

the power control signal to turn off unused elements, and the adjustment is based on the 

power control position determined at step 382. 

According to an alternative embodiment, the N delay stages ahead of the 

maximum delay stage of interest may be powered-down. The power control circuit 
15 monitors the ZLOOP and QLOOP delay line usage and applies a power down signal to 

positions "MAX+N," where "MAX" is the highest position being used, and N is 1 and 

greater. 

In a DLL circuit having a large delay line, an entire cycle of the reference clock 

frequency might be contained within the delay line. For instance, each step of a 64-bit 

20 delay line might be 50pS, thus, producing a delay of 3.2nS on the entire line. In such an 

embodiment, if the reference clock has a frequency of 500 MHz, the entire reference 
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clock cycle might be represented in 2.0nS, or 40 stages, of that delay line, leaving 24 
stages of the line unused. Therefore, it might be advantageous to detect a cycle 
boundary, where a DLL circuit crosses over from a maximum delay value to a minimum 
delay value in locking onto a signal. 

5 FIG. 10 illustrates an example of a circuit 400 for detecting a cycle boundary and 

generating a CYCLE BOUNDARY signal that may be received by a control circuit 
adapted, for example, to adjust the CONTROL signal to the MUXes 112 and 122 or the 
POWER CONTROL signal accordingly. In the circuit 400, a delay line 410 having a 
series of output taps receives the REFCLK signal. A first delay output tap Delay(O) is 

10 output to a MUX 412, and a maximum delay output tap Delay(N-l) is output to a MUX 
414. An output of the MUX 412 drives a buffer 416 whose output is received by a phase 
detector 420. Likewise, an output of the MUX 414 drives a buffer 418 whose output is 
also received by the phase detector 420. An output of the phase detector 420 is received 
by a reference control 430 that generates the CYCLE BOUNDARY signal. 

15 In one embodiment, the phase detector 420 receives a first output of the delay line 

410, a bit position (0), and finds another point on the delay line 410 that matches the bit 
position (0) in phase. To do that, the phase detector 420 may compare other bit positions, 
for example sequentially and in order, until the phase detector 420 finds a match. For 
instance, the phase detector 420 might check a position N to determine if the bit position 

20 N is ahead or after the bit position (0), and, if it is not ahead, the phase detector 420 may 

instruct reference control 430 to increment the bit position to N+l. At some point, if the 
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reference frequency is fast enough, the phase detector 420 detects a point where the bit 
position "N" happens before the bit position "0". At that point, the phase detector 420 
determines that one-cycle of the reference clock takes N stages of the delay line 410. 

According to one embodiment, the cycle boundary number N determined using 

5 the cycle boundary detector circuit 400 may be fed back to manage the primary loop 
circuit 140 described in reference to Figure 1. During a normal operation, the circuit 140 
may be arranged to increment its bit position by one bit position every hour in order to 
track a slowly drifting reference signal. After some time, a loop may approach a last bit 
position, such as a bit position 64 of a 64-stage delay line, for instance. Instead of 

10 approaching the 64th stage of the delay line, at which point the loop becomes unlocked, 
and the data might be lost while the loop re-acquires, the loop might utilize the number N 
determined by the cycle boundary detector circuit of Figure 10. Using the value of N, the 
loop may determine that the bit position "64-N" may be considered as the bit position 64. 
Therefore, for instance, if N=40, instead of moving to position 64 on an "increment" step, 

15 the controller may move to position 24. Alternatively, during an increment step, the 
controller may move to position "64-N+M", where M is a fixed value, such as 2, for 
instance. 

Further, die value of "N" determined using the cycle boundary circuit 400 may 

also be used by the quadrature loops. For example, if a quadrature loop ("Q-loop") is 

20 locked at bit position 60, and the Q-loop controller is adapted to use +/- 10 bit positions 

to create QUP and QDOWN signals, the quadrature loop might require bit positions 50 
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and 70. However, in a 64-bit delay line, the Q-loop controller will not be able to use the 
70th bit position. However, if N=40, the Q-loop may use bit positions 50 and 30. 

One problem that can arise in a DLL is that the circuits used to implement the 
MUX elements may have data dependent response characteristics, e.g. the speed of the 

5 response to a logic zero is different than the speed of the response to a logic one. FIG. 1 1 
illustrates an example of an inverter circuit 450 configured to be enabled by a differential 
enable signal that may experience a data dependent response. In the circuit 450, an 
enable signal EN, such as a line from the POWER CONTROL signal of the circuit 300 of 
FIG. 7, drives an NMOS transistor 458 while a complementary enable signal ENB drives 

10 a PMOS transistor 452. When EN and ENB are active, then a DATA signal is inverted 
by a PMOS transistor 454 and an NMOS transistor 456 to drive an output terminal OUT. 
When EN and ENB are inactive, then the inverter formed by the transistors 454 and 456 
is disabled because no current can flow between the power supply rails Vdd and Vss- 
The circuit 450 suffers from data dependent behavior because the amount of capacitance 

15 that the non-selected MUX paths contribute to the output node is a function of the data 
signal driving the non-selected nodes. For example, if a first data input IN(0) is selected, 
the devices 454 and 456 contribute one value of capacitance to the OUT node if the 
DATA(0) is low, and another capacitance value if DATA(0) is high. During the 
operation, the propagation delay thru the MUX element from the selected input to the 

20 common output will therefore vary as a function of the data states on the non-selected 

inputs, thus, resulting in a data dependent response. 
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FIG. 12 illustrates a circuit 470 that addresses the problem of data dependent 
response exhibited by the circuit of FIG. 11. In the circuit 470, a first data input IN(0) is 
input to an inverter 472 which drives one input of a NAND gate 474. An output of the 
NAND gate 474 drives an input of tri-state inverter 476. The output of the inverter 476 is 

5 coupled to a common output circuit node OUT. A second input of the NAND gate 474 is 
driven by a first enable signal EN(0) derived from the CONTROL signal received by the 
MUX. The inverter 476 receives its tri-state control from EN(0). Each input to the MUX 
has a similar circuit, as illustrated by the ellipses leading to gates 482, 484 and 486 for 
processing Nth data input IN(N) and enable signal EN(N). Via the circuit shown in FIG. 

10 12, all of the non-selected MUX paths contribute the same amount of capacitance load to 
the output node, regardless of the data values on the non-selected paths. When EN(0) is 
low, the logic gate 474 blocks any contribution from DSf(O) to the logic state at the input 
of the tri-state inverter 476. If EN(0) is low, the logic gate 474 drives the node at the 
input of the tri-state inverter 476 to a logic high, removing data dependence described in 

15 reference to FIG. 11. Alternatively, the logic gate 474 could be replaced with other logic 
elements that achieve the same functionality. 

FIG. 13 illustrates an example of a two-level 64 input MUX circuit 500 suitable 
for use in the circuits discussed above. MUX circuit 500 is constructed from a first level 
of 8-to-l MUXes 502, 504, 506, 508, 510,- 512, 514 and 516 that are driven by most 

20 significant three bits of the CONTROL signal CONTROL<5:3>. The output of each of 
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the first level MUXes is input to a second level 8-to-l MUX 520 that is driven by the 
next three least significant bits CONTROL<2:0> and a control bit <3>. 

The output of 8-to-l MUX 520 produces an output MUXJDUT for the 64-input 
MUX circuit 500. According to an exemplary embodiment, any of the 64 inputs to the 

5 MUXes 502-516 can be selected to become the MUX OUT signal. However, the 64 
input signals are not just any digital signals. Instead, they correspond to 64 taps of the 
delay line. Therefore, a signal transition at a tap "0" may occur 50pS before a signal 
transition at a tap "1", which then transitions 50pS earlier that a signal at a tap "3", and so 
on. This characteristic of the delay line, where every consecutive signal is delayed with 

10 respect to its first neighboring signal, could change if the MUX adds more error than this 
unit-delay spacing. To minimize the error introduced by the MUX, it is desirable that the 
change between any two neighboring signals introduced by the MUX is identical. 

For instance, the first eight taps of the delay line could be connected to the first 
MUX 502, and the next eight taps could be connected to the second MUX 504, and so on. 

15 In such an embodiment, if the MUX circuit 500 switches between a bit position 4 and a 
bit position 5, then the only control signal that is changing is the control code on the 
MUX 502> and the control code on the MUX 520 stays the same since the MUX 520 
reads from the same MUX 502 in the two consecutive reads. Therefore, only one MUX 
is changing in between the bit positions 4 and -5. However, in such a configuration, if the 

20 MUX circuit 500 switches between bit positions 8 and 9 (i.e., a crossover point between 

the MUXes 502 and 504), the control signals not only change on the input MUXes, but 
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also on the output MUX 520, since the MUX 520 now selects the output from the second ■ 
MUX 504. Thus, the process of changing from the bit positions 8 and 9 results in two 
changes rather than one change as in the case of switching from the bit positions 4 and 5. 
Therefore, such a configuration is not desirable. 

5 Instead, according to a preferred embodiment, the delay line taps are wrapped 

around the eight MUXs 502-516 so that the control code changes only once during any 
bit position read. The delay line taps are connected to the MUXes in such a way so that 
the bit position 1 is connected to the first input of the MUX 502, the bit position 2 is 
connected to the first input of the MUX 504, the bit position 3 is connected to the first 

10 input of the MUX 506, and so on until the bit position 8 is connected to the first input of 
the MUX 516. According to this embodiment, the next bit position, i.e., the bit position 

9, is also connected to the MUX 516. Specifically, the bit position 9 is connected to the 
second input of the MUX 516. Similarly, going up the line of MUXes, the bit positions 

10, 11, 12, 13, 14, and 15 are connected to the second inputs of the MUXes 514, 512, 
15 510, 508, 506, and 504. Similarly to the bit positions 8 and 9 being connected to the 

same MUX 516, the bit positions 16 and 17 are connected to the MUX 502. In such a 

setup, eight inputs to the MUX 502 are bit positions 1, 16, 17, 32, 33, 48, 49, and 64. 

Using the MUX setup described in the preceding paragraph, when the circuit 500 

switches between bit positions 1 through 8,- none of the control signals on the MUXes 

20 502-516 are changing, and only the control signal on the output MUX 520 is changing to 

select a different MUX for each consecutive output. When the circuit crosses from the 
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bit position 8 to the bit position 9, the control signals to the MUXes 502-516 are 
changing, but the control signal on the MUX 520 remains the same since the bit position 
8 and the bit position 9 are read from the same MUX 516. As the count increases from 
the bit position 3 to the bit position 16, the MUX 520 moves up the line from the MUX 

5 516 towards the MUX 502 by changing the control signal being input to the MUX 520. 
In all cases, only one MUX changes in between any two consecutive positions. 

As mentioned earlier, the MUXes 502-516 are driven by three most significant 
bits of the control signal (i.e., CONTROL <5:3>), and the three bits are decoded into one 
of eight values, and the decoded value selects which input is passed to the MUX's output. 

10 For example, if the three bits are "010", then the second input is passed to the output. 
Similarly, the output MUX 520 is driven by three, least significant bits, i.e., CONTROL 
<2:0>, and a control signal <3>. The extra control signal is used to invert the three least 
significant bits. If the control signal is low, no inversion is done, and bits "010" in 
CONTROL <2:0> indicate that the second input of the MUX 520 drives the MUXOUT. 

15 If the control signal is high, then the bits "010" are inverted to "101", and the fifth input 
is now selected. In such an embodiment, the control code "000111" selects the eighth 
delay line position, and the control word "001000" selects the ninth bit position. 

FIG. 14 is a logic circuit diagram illustrating an embodiment of an edge-triggered 
RS latch 550 suitable for use in the embodiments, e.g. as SR latch 130 of FIG. 1 and FIG. 

20 7. The SR latch illustrated in FIG. 14 has a transition delay of a set signal (S) to an 

output signal (Q) that is equal to a signal transition from a reset signal (R) to the Q signal. 
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In the SR latch 550, a D flip-flop 552 is clocked by a signal received at the S 
input, which then clocks a logical one from V D d to output Q of the flip-flop 552. The 
output Q of the flip-flop 552 is coupled to an input of a NOR gate 570 and further 
through an inverter 556 to an input of a NAND gate 560. Similarly, a D flip-flop 554 is 
clocked by a signal received at the R input, which then clocks a logical one from V D d to 
an output Q of the flip-flop 554, An output Q of the flip-flop 554 is coupled to an input 
of a NOR gate 572 and through an inverter 558 to an input of a NAND gate 562. The 
NAND gates 560 and 562 are cross-coupled such that the output of each is input to the 
other. Likewise, the NOR gates 570 and 572 are cross-coupled such that the output of 
each is input to the other. The output of the NAM) gate 560 drives an inverter 566 that, 
in turn, drives an output QB. The output of the NAND gate 562 drives an inverter 564 
that, in turn, drives an output Q. Similarly, the output of the NOR gate 570 drives an 
inverter 574 that, in turn, drives the output Q. The output of the NOR gate 572 drives an 
inverter 576 that, in turn, drives the output QB. The Q output drives an inverted reset 
input of the D flip-flop 554 and the QB output drives an inverted reset input of a D flip- 
flop 552, The NAND gates 560 and 562 and the NOR gates 570 and 572 preserve the 
state of the outputs Q and QB of the RS latch 550, which are determined by the signals 
input to the S and R inputs. 

The wired OR produced by the coupling of the outputs of the inverters 564 and 

574 and the interpolator such as the wired or produced by the coupling of the outputs of 

the inverters 566 and 576 accommodates the one gate delay difference introduced by the 
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inverters 556 and 558. The number of gate delays from each of the S and R inputs to 
outputs Q and QB is approximately the same, which reduces the possibility of a race 
condition arising from signals arriving at different points of the circuit at different times. 
The foregoing circuit may be readily implemented by those skilled in the art of 

5 digital circuit design, and the embodiments described are not intended to be limited to 
any particular process or fabrication techniques. For instance, a conventional synthesized 
logic design flow may be used. It should be understood that the present invention is not 
limited to the circuits, methods and systems described herein. Equivalent circuits will 
become apparent to those skilled in the art of circuit design upon review of the foregoing. 

10 The performance of various types of circuits and systems may be improved through the 
application of the teachings described herein. 

In view of the wide variety of embodiments to which the principles of the present 
invention can be applied, it should be understood that the illustrated embodiments are 
examples only, and should not be taken as limiting the scope of the present invention. 

15 For example, the circuits may employ different types of transistors and amplifiers to 
obtain the functions described above and the functions may be achieved with more or 
fewer elements that those illustrated above. Further, some elements of the embodiments 
described may be implemented in software, hardware, firmware, or a combination of 
these approaches. Further, it will be apparenHo those of ordinary skill in the art that the 

20 teachings of the present invention may be applied to other systems besides RAMBUS 
based data busses. 
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The claims should not be read as limited to the described order or elements unless 
stated to that effect. Therefore, all embodiments that come within the scope and spirit of 
the following claims and equivalents thereto are claimed as the invention. 
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